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Abstract 

In this paper we show that BPP is truth-table reducible to the set of Kolmogorov 
random strings Rk- It was previously known that PSPACE, and hence BPP is 
Turing-reducible to Rk- The earlier proof relied on the adaptivity of the Turing- 
reduction to find a Kolmogorov-random string of polynomial length using the set 
Rk as oracle. Our new non-adaptive result relies on a new fundamental fact about 
the set Rk, namely each initial segment of the characteristic sequence of Rk is 
not compressible by recursive means. As a partial converse to our claim we show 
that strings of high Kolmogorov-complexity when used as advice are not much more 
useful than randomly chosen strings. 

1 Introduction 

Kolmogorov complexity studies the amount of randomness in a string by the smallest 
program that can generate it. The most random strings are those we cannot compress 
at all making the set Rk = {x \ K(x) > \x\} of Kolmogorov random strings worthy of 
close analysis. 

Allender et al. [ABK+02] showed the surprising computational power of Rk includ- 
ing that polynomial time adaptive (Turing) access to Rk enables one to do PSPACE- 
computations: PSPACE C P Rk . One of the ingredients in the proof shows how on input 
n one can in polynomial time with adaptive access to Rk generate a polynomially long 
Kolmogorov random string. With non- adaptive access it is only possible to generate in 
polynomial time a random string of length at most O(logn). 
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In an attempt to characterize PSPACE as the class of sets reducible to Rk, Allender, 
Buhrman and Koucky [ABK06] noticed that this question depends on the choice of 
universal machine used in the definition of the notion of Kolmogorov complexity. They 
also started a systematic study of weaker and non-adaptive access to Rk- They showed 
for example that 

P = RECnf|{A| A<P m R Ku }. 
u 

This result and the fact that with non-adaptive access to Rk in general only logarithmi- 
cally small strings can be found seems to suggest that adaptive access to Rk is needed 
in order to be useful. 

Our first result proves this intuition false: We show that polynomial time non- 
adaptive access to Rk can be used to derandomize any BPP computation. In order to 
derandomize a BPP computation one needs a (pseudo) random string of polynomial size. 
As mentioned before one can only obtain short, 0(log n) sized, random strings from Rk- 
Instead we show that the characteristic sequence formed by the strings of length clogra, 
i?^ clogn , itself a strings of length n c , is complex enough to figure as a hard function in 
the hardness versus randomness framework of Impagliazzo and Wigderson [I W97] . This 
way we construct a pseudorandom generator that is strong enough to derandomize BPP. 

In particular we show that for every time bound t, there is a constant c such that 
Rk i.o.-DTIME(£)/2 n-c . This is in stark contrast with the time-unbounded case 
where only n bits of advice are necessary jBar68]. As a consequence we give an alter- 
native proof of the existence of an r.e. set A, due to Barzdin [Bar68], such that for all 
time bounds t, there exists q such that K^'^Ai-n \ n) > n/ct- We simply take for A 
the complement of Rk- Barzdin also showed that this lower bound is optimal for r.e. 
sets. Hence the constant depending on the time-bound in our Theorem [3] is optimal. 

Next we try to establish whether we can characterize BPP as the class of sets that 
non-adaptively reduce to Rk- One can view the truth-table reduction to Rk as a 
computation with advice of K t<yn ^ complexity Q(n). We can show that for sets in EXP 
and t(n) € 2 nQ1> , polynomial-time computation with polynomial (exponential, resp.) 
size advice of if*( n ) complexity n — O(logn) (n — O(loglogn), resp.) can be simulated 
by bounded error probabilistic machine with almost linear size advice. For paddablc 
sets that are complete for NP, P #P , PSPACE, or EXP we do not even need the linear 
size advice. Hence, advice of high K 1 ^ complexity is no better than a truly random 
string. 

Summarizing our results: 

• For every computable time bound t there is a constant c (depending on t) such 
that R K i.o.-DTIME(t)/2 n " c . 

• The complement of Rk is a natural example of an computably enumerable set 
whose characteristic sequence has high time bounded Kolmorogov complexity for 
every n. 
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• BPP is truth-table reducible to Rk. 

• A poly- up-to exponential-size advice that has very large K 1 ^ complexity can be 
replaced by 0(n log n) bit advice and true randomness. 

2 Preliminaries 

We remind the reader of some of the definitions we use. Let M be a Turing machine. 
For any string x G {0, 1}*, the Kolmogorov complexity of x relative to M is Km(x) = 
min{ \p\ | p G {0, 1}* &: M(p) = x}, where \p\ denotes the length of string p. It is 
well known that for a universal Turing machine U and any other machine M there is a 
constant cm such that for all strings x, Ku(x) < Km{x) + cm- For the rest of the paper 
we will fix some universal Turing machine U and we will measure Kolmogrov complexity 
relative to that U. Thus, we will not write the subscript U explicitly. 

We define K l (x) = min{ \p\ \ U{p) = x and U(p) uses at most t(\x\) steps}. Unlike 
traditional computational complexity the time bound is a function of the length of the 
output of U. 

A string x is said to be Kolmogorov-random if K(x) > \x\. The set of Kolmogorov- 
random strings is denoted by Rk = {x G {0,1}* | K(x) > \x\}. For an integer n and 
set A C {0, 1}*, A =n = A n {0, l} n . The following well known claim can be proven by 
considering the Kolmogorov complexity of |-R^ n | (see [LV08J). 

Proposition 1 There is a constant d such that for all n, \Rk u \ > 2 n /d. 

We also use computation with advice. We deviate slightly from the usual definition 
of computation with advice in the way how we express and measure the running time. 
For an advice function a : N — > {0, 1}*, we say that L G P/a if there is a Turing machine 
M such that for every x € {0,1}*, M(x,a(\x\)) runs in time polynomial in the length 
of x and M(x,a(\x\)) accepts iff x 6 L. We assume that M has random access to its 
input so the length of a(n) can grow faster than any polynomial in n. Similarly, we 
define EXP /a where we allow the machine M to run in exponential time in length of x 
on the input (x,a(|x|)). Furthermore, we are interested not only in Boolean languages 
(decision problems) but also in functions, so we naturally extend both definitions also 
to computation with advice of functions. Typically we are interested in the amount of 
advice that we need for inputs of length n so for / : N — >N,C//is the union of all 
C/a for a satisfying |a(n)| < f(n). 

Let L be a language and C be a language class. We say that L G i.o.—C if there 
exists a language L' G C such that for infinitely many n, L =n = L' =n . For a Turing 
machine M, we say L G i.o.-M// if there is some advice function a with |a(n)| < f(n) 
such that for infinitely many n, L =n = {x G S n | M(x,a(\x\)) accepts}. 

We say that a set A polynomial-time Turing reduces to a set B, if there is an oracle 
machine M that on input x runs in polynomial time and with oracle B decides whether 
x G A. If M asks its questions non-adaptively, i.e., each oracle question does not depend 
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on the answers to the previous oracle questions, we say that A polynomial-time truth- 
table reduces to B (A<^ t B). Moreover, A<^ tt B if machine M outputs as its answer 
the disjunction of the oracle answers. Similarly, A <£tt B for the conjunction of the 
answers. 

3 High circuit complexity of Rk 

In this section we prove that the characteristic sequence of Rk has high circuit com- 
plexity almost everywhere. We will first prove the following lemma. 

Lemma 2 For every total Turing machine M there is a constant cm such that Rk is 
not in \.o.-M/2 n ~ CM . 

There is a (non-total) Turing machine M such that Rk is in M/n + 1 where the 
advice is the number of strings in R K n - Simply find all the non-random strings of length 
n. This machine will fail to halt if the advice underestimates the number of random 
strings. 

Proof of Lemma\^ Suppose the theorem is false. Fix a total machine M. We have that, 
(x, a) £ L(M) if and only if x £ Rk, for some advice a of length k < 2 n ~ CM and every 
x of some large enough length n. By padding the advice we can assume k = 2 n ~ CM . We 
will set cm later in order to get a contradiction. 

Let Rr = {x £ S n | (x,(3) £ L(M)}. By Proposition [1] for some constant d, 
\R a \ > 2 n /d so we know that if \Rp\ < 2 n /d then ^ a. We call (3 good if \R P \ > 2 n /d. 

Fix a good ft and choose x%, . . . , x m at random. The probability that all the Xi are 
not in Ra is at most (1 — l/d) m < 2~ m l d . There are 2 k advice strings of length k so 
if 2~ m / d < 2~ k then there is a sequence x\, . . . , x m such that for every good f3 of length 
k there is an i such that Xi £ Rp. 

We can computably search all such sequences so let x±, . . . , x m be the lexicographi- 
cally least sequence such that for each good j3 of length k, there is some Xi £ Rp. This 
also means xi € R a for some i so for one of the Xi we have K{xi) > n. 

Fix m = 2 n ~ a for a constant a to be chosen later. 

We can describe Xi by n — a + 6 log a bits for some constant b: n — a bits to describe 
i, O(loga) bits to recover n and a constant number of additional bits to describe k, M, 
d and the algorithm above for finding xi, . . . , x m . If we pick a such that a > b log a we 
contradict the fact that K(xi) > n. 

If we pick cm > a + logd we then have 2 n ~ a > 2 n ~ CM d, m > kd and 2~ rn / d < 2~ k 
completing our contradiction. □ 

In order to get our statement about time bounded advice classes we instantiate 
Lemma [2] with universal machines Ut that run in time t, use the first part of their 
advice, in prefix free form, as a code for a machine that runs in time t and has the 
second part of the advice for Ut as its advice. The following is a direct consequence of 
Lemma [2 
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Lemma 3 For every computable time bound t and universal advice machine Ut there is 
a constant a such that Rk is not in i.o.-Ut/2 n_c * . 

We are now ready to prove the main theorem from this section. 

Theorem 4 For every computable time bound t there is a constant dt such that Rk is 
not in i.o.-DTIME(t)/2 n - dt . 

Proof. Suppose the theorem is false, that is there is a time bound t such that for 
every d there is a machine that runs in time t such that R^ £ i.o.-M c ;/2 n_a! . Set 
t' = t log t and let q/ be the constant that comes out of Lemma [3] when instantiated 
with time bound t'. Set d = cy + 1 and let the code of machine from the (false) 
assumption have size e. So we have that R^ £ i.o.-Md/2 n ~ d . This in turn implies that 
Rk £ i.o.-LV/2 n ~ d + e + 21oge, which implies that Rk £ i.o.-Ut'/2 n ~°*' a contradiction 
with Lemma [3j The last step is true because the universal machine running for at most 
time tf = t\ogt, can simulate Mj, who runs in time t. 

□ 

As an immediate corollary we get an alternative, more natural candidate for Barzdin's 
computably enumerable set that has high resource bounded Kolomorov complexity, 
namely the set of compressible strings. 

Corollary 5 For every computable time boundt there is a constant c such that : 
n) | n) > n/c 

Barzdin [Bar68] also showed that this lower bound is optimal. That is the dependence 
of c on the time bound t is needed for the characteristic sequence of every r.e. set. Hence 
this depence is also necessary in our Theorem HJ 

4 BPP truth-table reduces to Rk 

In this section we investigate what languages are reducible to Rk- We start with the 
following theorem which one can prove using nowadays standard derandomization tech- 
niques. 

Theorem 6 Let a : {0}* — > {0, 1}* be a length preserving function and 5 > be a 
constant. If a(0 n ) i.o.-EXP/n 5 then for every A E BPP there exists d > such that 
A G P/a(0 nd ). 

Proof. a(0 n ) i.o.-EXP/n 5 implies that when a(0 n ) is interpreted as a truth-table of a 
function f a (on) : {0, l} lo § n — > {0, 1}, / a (o») does not have boolean circuits of size n 5 / 3 for 
all n large enough. It is known that such a function can be used to build the Impagliazzo- 
Wigderson pseudorandom generator [IW97j which can be used to derandomize boolean 
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circuits of size n 5 ' for some 5' > (see [I W971 lKvM991 |ABK + Q2] ) . Hence, bounded-error 
probabilistic computation running in time r/ can be derandomized in polynomial time 
given access to a(0 n2l/S ). □ 

From Theorem |4] and the above Theorem we obtain the following corollary. 
Corollary 7 BPP <£ R K . 

Proof. Let a(0 n ) be the truth-table of Rk on strings of length [lognj padded by zeros 
to the length of n. By Theorem HI a(0 n ) G" i.o.-EXP/(n/c) for some c > 0. Consider 
any A G BPP. By Theorem [6] for some d, A G P/a(0 n ). The claim follows by noting 
that a truth-table reduction to Rk may query the membership of all the strings of length 
[logn d J to construct a(0 n ) and then run the P/a(0 n ) algorithm for A. □ 

Our goal would be to show that using Rk as a source of randomness is the only way 
to make use of it. Ideally we would like to show that any recursive set that is truth-table 
reducible to Rk must be in BPP. We fall short of such a goal. However we can show 
the following claim. 

Theorem 8 Let a : {0}* — > {0, 1}* be a length preserving function and c > be a 
constant. If a(0 n ) £ i.o.-EXP/n — clogn then for every A G EXP if A G P/a(0 n ) for 
some d > then A G BPP/0(ralogn). 

This theorem says that Kolmogorov random advice of polynomial size can be replaced 
by almost linear size advice and true randomness. We come short of proving a converse 
of the above corollary in two respects. First, the advice is supposed to model the 
initial segment of the characteristic sequence of Rk which the truth-table can access. 
However, by providing only polynomial size advice we restrict the hypothetical truth- 
table reduction to query strings of only logarithmic length. Second, the randomness 
that we require from the initial segment is much stronger than what one can prove and 
what is in fact true for the initial segment of the characteristic sequence of Rk- One 
can deal with the first issue as is shown by Theorem [9] but we do not know how to deal 
with the second one. 

Proof. Let M be a polynomial time Turing machine and A G EXP be a set such that 
A(x) = M(x,a(\x\ d )). We claim that for all n large enough there is a non-negligible 
fraction of advice strings r of size n d that could be used in place of a(n d ) more precisely: 

Pr [Vx,x G A <J=^ M(x,r) = 1] > 
re{o,i} nd nC 

To prove the claim consider the set G = {r G {0, l} nd ; G {0, l} n ,x G A 
M(x,r) = 1}. Clearly, G G EXP and a(0 nd ) G G. If |G =nd | < 2 nd /n cd then a(0" d ) can 
be computed in exponential time from its index in the set G =n of length n d — cdlogn. 
Since a(0 n ) g" i.o.-EXP/n d — cdlogn this cannot happen infinitely often. 
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Now we present an algorithm that on input x samples from G using only 0(n log n) 
bits of advice (in fact O(logn) entries from the truth table of A) and outputs A(x) with 
high probability. Consider the following algorithm: 

1. Given an input x of length n, and an advice string x\,A{x\), Xk, A{xk), 

2. sample at most 2n cd strings of length n d until the first string r is found such that 
M(xi,r) = A(xi) for all i € {1, . . . , k}. 

3. If we find r consistent with the advice then output M(x, r) otherwise output 0. 

For all n large enough the probability that the second step does not find r compatible 
with the advice is upper-bounded by the probability that we do not sample any string 
from G which is at most (1 — ^d) 2nC < e~ 2 < 1/6. 

It suffices to show that we can find an advice sequence such that for at least 5/6- 
fraction of the r's compatible with the advice M(x, r) = A(x). For given n, we will find 
the advice by prnning iteratively the set of bad random strings B = {0, l} n \G. Let 
i = 0, 1, ... , 2cdlog 6/ / 5 n. Set Bq = B. If there is a string x G {0, 1}™ such that for at 

least 1/6 of r G B i} M(x,r) / A(x), then set Xi+i = x and B i+ i = Bi n {r G {0, l}" d \ 
M(xi + %,r) = A(x{ + i)}. If there is no such string x then stop and the x^s obtained 
so far will form our advice. Notice, if we stop for some i < 2cdlog 6 / 5 n then for all 
x £ {0, 1}™, PT r( zQ i [M(x,r) ^ A(x)] < 1/6. Hence, any r found by the algorithm to be 
compatible with the advice will give the correct answer for a given input with probability 
at least 5/6. On the other hand, if we stop building the advice at i = 2cdlog 6/ / 5 n then 

l^3cdiog 6/5 n| < 2™ d • (5/6) 2cdlog6 / 5 ™ < \G= nd \/n cd . Hence, any string r found by the 
algorithm to be compatible with the advice xx,A(x\), A(x{) will come from G 
with good probability, i.e., with probability > 5/6 for n large enough. □ 

The following theorem can be established by a similar argument. It again relies on 
the fact that a polynomially large fraction of all advice strings of length 2 n must work 
well as an advice. By a pruning procedure similar to the proof of Theorem [8] we can 
avoid bad advice. In the BPP algorithm one does not have to explicitly guess the whole 
advice but only the part relevant to the pruning advice and to the current input. 

Theorem 9 Let a : {0}* — > {0, 1}* be a length preserving function and c > be a 

constant. If a(0 n ) £ i.o.-EXP/n — cloglogn then for every A G EXP if A G P/a(0 ) 
for some d > then A G BPP/0(nlogn). 

We show next that if the set A has some suitable properties we can dispense with 
the linear advice all together and replace it with only random bits. Thus for example if 
SAT G P/a(0 ri ) for some computationally hard advice a(0 n ) then SAT G BPP. 

Theorem 10 Let a : {0}* — > {0, 1}* be a length preserving function and c > be a 
constant such that a(0 n ) $ i.o.-EXP/n — clogra. Let A be paddable and polynomial-time 
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many-one-complete for a class C G {NP, P #P , PSPACE, EXP}. If A € V/a{Q n ) for 
some d > then A G BPP (and hence C C BPP J. 

To prove the theorem we will need the notion of instance checkers. We use the 
definition of Trevisan and Vadhan |TV02] . 

Definition 11 An instance checker C for a boolean function f is a polynomial-time 
probabilistic oracle machine whose output is in {0,1, fail} such that 

• for all inputs x, Pr[C^ (x) = f(x)] = 1, and 

• for all inputs x, and all oracles f , Pr[C^'(x) G" {/(x),fail}] < 1/4. 

It is immediate that by linearly many repetitions and taking the majority answer 
one can reduce the error of an instance checker to 2~ n . Vadhan and Trevisan also state 
the following claim: 

Theorem 12 ( [ BFL91) . |LFKN92|, ISha92j ) Every problem that is complete /or EXP , 
PSPACE or P#P has an instance checker. Moreover, there are EXP-complete problems, 
PSPACE- complete problems, and P#^ -complete problems for which the instance checker 
C only makes oracle queries of length exactly i(n) on inputs of length n for some poly- 
nomial £(n). 

However, it is not known whether NP has instance checkers. 
Proof of Theorem \1(A To prove the claim for P*^*-, PSPACE- and EXP-complete 
problems we use the instance checkers. We use the same notation as in the proof of 
Theorem El i.e., M is a Turing machine such that A(x) = M(x, a(\x\ d )) and the set of 
good advice is G = {r G {0, l} nd ; Vx G {0, l} n ,x G A <J=^ M(x,r) = 1}. We know 
from the previous proof that \G =n \ > 2 n /n cd because a(0 n ) i.o.-EXP/n — clogn. 

Let C be the instance checker for A which on input of length n asks oracle queries 
of length only £(n) and makes error on a wrong oracle at most 2~ n . The following 
algorithm is a bounded error polynomial time algorithm for A: 

1. On input x of length n, repeat 2n cd times 

(a) Pick a random string r of length (l(n)) d . 

(b) Run the instance checker C on input x and answer each of his oracle queries 
V by M(y,r). 

(c) If C outputs fail continue with another iteration otherwise output the output 
of C. 

2. Output 0. 



S 



Clearly, if we sample r € G then the instance checker will provide a correct answer 
and we stop. The algorithm can produce a wrong answer either if the instance checker 
always fails (so we never sample r € G during the iterations) or if the instance checker 
gives a wrong answer. Probability of not sampling good r is at most 1/6. The probability 
of getting a wrong answer from the instance checker in any of the iterations is at most 
2n cd /2 n . Thus the algorithm provides the correct answer with probability at least 2/3. 

To prove the claim for NP-complete languages we show it for the canonical example 
of SAT. The following algorithm solves SAT correctly with probability at least 5/6: 

1. On input (ft of length n, repeat 2n cd times 

(a) Pick a random string r of length n d . 

(b) If M((ft,r) = 1 then use the self-reducibility of SAT to find a presumably 
satisfying assignment a of (ft while asking queries ift of size n and answering 
them according to M(ift, r). If the assignment a indeed satisfies (ft then output 
1 otherwise continue with another iteration. 

2. Output 0. 

Clearly, if (ft is satisfiable we will answer 1 with probability at least 5/6. If is not 
satisfiable we will always answer 0. □ 

5 Open Problems 

We have shown that the set Rk cannot be compressed using a computable algorithm 
and used this fact to reduce BPP non-adaptively to Rk- We conjecture that every 
computable set that non-adaptively reduces in polynomial-time to Rk sits in BPP and 
have shown a number of partial results in that directions. 

The classification of languages that polynomial-time adaptively reduce to Rk also 
remains open. Can we characterize PSPACE this way? 

References 

[ABK+02] E. Allender, H. Buhrman, M. Koucky, D. van Melkebeek, and D. Ronneb- 
urger. Power from random strings. In Proceedings of the 43th Annual Sym- 
posium on Foundations of Computer Science, pages 669-678, 2002. 

[ABK06] E. Allender, H. Buhrman, and M. Koucky. What can be efficiently reduced 
to the K-random strings? Annals of Pure and Applied Logic, 138:2-19, 2006. 

[Bar68] J. Barzdin. Complexity of programs to determine whether natural numbers 
not greater than n belong to a recursively enumerable set. Soviet Mathemat- 
ics Doklady, 9:12511254, 1968. 



9 



[BFL91] L. Babai, L. Fortnow, and C. Lund. Non-deterministic exponential time has 
two-prover interactive protocols. Computational Complexity, 1:3-40, 1991. 



[IW97] R. Impagliazzo and A. Wigderson. P = BPP if E requires exponential 
circuits: Derandomizing the XOR lemma. In Proceedings of the 29th Annual 
ACM Symposium on Theory of Computing, pages 220-229, 1997. 

[KvM99] A. Klivans and D. van Melkebeek. Graph nonisomorphism has subexponen- 
tial size proofs unless the polynomial-time hierarchy collapses. In Proceedings 
of the 31st Annual ACM Symposium on Theory of Computing, pages 659- 
667, 1999. 

[LFKN92] C. Lund, L. Fortnow, H. Karloff, and N. Nisan. Algebraic methods for 
interactive proof systems. Journal of ACM, 39:859-868, 1992. 

[LV08] M. Li and P. Vitanyi. An Introduction to Kolmogorov Complexity and Its 
Applications. Graduate Texts in Computer Science. Springer, New York, 
third edition, 2008. 

[Sha92] A. Shamir. IP = PSPACE. Journal of ACM, 39:869-877, 1992. 

[TV02] L. Trevisan and S. Vadhan. Pseudorandomncss and average-case complexity 
via uniform reductions. In Proceedings of the 17th Computational Complexity 
Conference, pages 129-138, 2002. 



10 



